Methods and apparatuses related to text caption error correction

ABSTRACT

Systems and methods related to providing error correction in a text caption are disclosed. A method may comprise displaying a text caption including one or more blocks of text on each of a first device and a second device remote from the first device. The method may also include generating another block of text and replacing a block of text of the text caption with the another block of text. Furthermore, the method may include displaying the text caption on the second device having the block of text of the first text caption replaced by the another block of text.

CROSS-REFERENCE TO RELATED APPLICATION

This application is a continuation of U.S. patent application Ser. No. 13/768,918, filed Feb. 15, 2013, pending, which application is a continuation of U.S. patent application Ser. No. 12/624,973, filed Nov. 24, 2009, now U.S. Pat. No. 8,379,801, issued Feb. 19, 2013, the disclosure of each of which is hereby incorporated herein by this reference in its entirety.

TECHNICAL FIELD

Embodiments of the disclosure relate generally to text captioning and more specifically to correction of errors within a text caption.

BACKGROUND

Modern telecommunication services provide features to assist those who are deaf or hearing-impaired. One such feature is a telecommunication device for the deaf (TDD). Hearing-capable users communicate with hearing-impaired users who are users of TDD devices through so-called “relays.” A relay is a telecommunication intermediary service, which is intended to permit a deaf or a hearing-impaired person to utilize a normal telephone network. A relay service may include an operator, referred to as a “call assistant,” who serves as a human intermediary between a hearing user and a hearing-impaired user. The call assistant communicates with the hearing-impaired user using a TDD and communicates with the hearing user by voice over a standard telephone line.

A text captioned telephone system employs a relay service in a mode where the relay transmits both the voice of the hearing user and a text stream of the words spoken by the hearing user. A hearing-impaired user using a text captioned telephone, or telephone enabled to do text enhanced telephony, may carry on a normal telephone conversation with a hearing user while a text transcription of the words spoken by the hearing user is displayed on the text captioned telephone. The text transcription may allow the hearing-impaired user to confirm his or her understanding of the words spoken by the hearing user.

More specifically, during a communication session, a call assistant may listen to the voice signal of a hearing user and “revoice” the words to a speech recognition computer program tuned to that call assistant's voice. A text transcription output from the speech recognition computer is then transmitted to the text captioned telephone being used by the hearing-impaired user. Even with revoicing provided by a trained call assistant, the text transcription received by the hearing-impaired user may include errors. Therefore, correction of the errors within the text transcription may be required.

According to various conventional methods, errors within a text caption are corrected by either backspacing an error in a text caption and displaying corrected text or providing a corrected portion (e.g., a word or a sentence) at the end of a previously provided text caption. Although, backspacing an error in a text caption and displaying corrected text may provide a hearing-impaired user with a context for the correction, this method is distracting to a hearing-impaired user and interrupts the continuity of a conversation between a hearing-impaired user and a hearing user. Further, providing a corrected portion at the end of a previously provided text caption not only interrupts the continuity of a conversation but also fails to provide context of the correction to the hearing-impaired user. Therefore, a hearing-impaired user must determine where the corrected text should be inserted into the previously provided text caption.

A need exists to improve text correction of a text captioning system. Specifically, there is a need for methods of providing text caption correction while providing a user with context of a correction and without distracting the user or interrupting the continuity of a conversation between a hearing-impaired user and a hearing user.

BRIEF SUMMARY

In one embodiment of the disclosure, a method of providing error correction in a text caption is disclosed. The method may comprise displaying a text caption including one or more blocks of text on each of a first device and a second device remote from the first device. The method may also include generating another block of text and replacing a block of text of the text caption with the another block of text. Furthermore, the method may include displaying the text caption on the second device having the block of text of the first text caption replaced by the another block of text.

In another embodiment of the disclosure, a communication system is provided. The communication system may include a plurality of devices, wherein each device of the plurality includes a processor and a computer-readable medium coupled to the processor. The communication system may further include a plurality of application programs, wherein each application program is stored in an associated computer-readable medium. When executed by the processor, one or more application programs are configured to display a text caption including one or more blocks of text on a display device of each of a first device of the plurality of devices and a second device of the plurality of devices. One or more application programs may be configured to generate another block of text and replace one block of text of the text caption with the another block of text. Furthermore, one or more application programs may be configured to display the corrected text caption on the display device of the second device, wherein the at least one block of text of the text caption is replaced by the another block of text.

Another embodiment of the disclosure may include a computer-readable media storage medium storing instructions that when executed by a processor cause the processor to perform instructions in accordance with one or more embodiments of the disclosure.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates a hearing-impaired communication system, in accordance with an embodiment of the disclosure;

FIG. 2 is a block diagram of the communication system of FIG. 1, according to an embodiment of the disclosure;

FIG. 3 illustrates a text caption including errors;

FIG. 4 illustrates a corrected block of text;

FIG. 5 illustrates a corrected text caption including corrected text and further including identified corrected words, in accordance with an embodiment of the disclosure; and

FIG. 6 is a flow chart illustrating a method of providing error correction within a text caption, according to an embodiment of the disclosure.

DETAILED DESCRIPTION

In the following detailed description, reference is made to the accompanying drawings which form a part hereof and, in which is shown by way of illustration, specific embodiments in which the invention may be practiced. These embodiments are described in sufficient detail to enable those of ordinary skill in the art to practice the invention, and it is to be understood that other embodiments may be utilized, and that structural, logical, and electrical changes may be made within the scope of the disclosure.

In this description, functions may be shown in block diagram form in order not to obscure the disclosure in unnecessary detail. Furthermore, specific implementations shown and described are only examples and should not be construed as the only way to implement the disclosure unless specified otherwise herein. Block definitions and partitioning of logic between various blocks represent a specific implementation. It will be readily apparent to one of ordinary skill in the art that the various embodiments of the disclosure may be practiced by numerous other partitioning solutions. For the most part, details concerning timing considerations, and the like, have been omitted where such details are not necessary to obtain a complete understanding of the disclosure in its various embodiments and are within the abilities of persons of ordinary skill in the relevant art.

Referring in general to the following description and accompanying drawings, various aspects of the disclosure are illustrated to show its structure and method of operation. Common elements of the illustrated embodiments are designated with like numerals. It should be understood the figures presented are not meant to be illustrative of actual views of any particular portion of the actual structure or method, but are merely idealized representations which are employed to more clearly and fully depict the disclosure.

When executed as firmware or software, the instructions for performing the methods and processes described herein may be stored on a computer readable medium. A computer readable medium includes, but is not limited to, magnetic and optical storage devices such as disk drives, magnetic tape, CDs (compact disks), DVDs (digital versatile discs or digital video discs), and semiconductor devices such as RAM, DRAM, ROM, EPROM, and Flash memory.

FIG. 1 illustrates a communication system 100 configured to facilitate a communication session between a hearing-impaired user and a hearing-capable user, in accordance with one or more embodiments of the disclosure. Communication system 100 may include communication device 120, communication device 190, and a relay service 110. Communication device 190 may be coupled to communication device 120 via a network 180 and communication device 120 may be coupled to relay service 110 via a network 170. For example only, network 170 and network 180 may each be implemented according to the standards and bandwidth requirements of a telephone network such as the Public Switch Telephone Network (PSTN). The use of the term “telephone network” as used herein also contemplates networks that are compatible and configured to provide communications using digital standards, an example of which includes Voice Over Internet Protocol (VOIP). By way of example and not limitation, enhanced bandwidth requirements of various communication devices may be better accommodated by providing high bandwidth networks and connections thereto through various enhanced-bandwidth networking technology including DSL, Cable, Ethernet, etc.

As described more fully below, relay service 110 may be configured to provide interpretive services to hearing-impaired user 140. More specifically, a human “call assistant” within relay service 110 may be employed to facilitate a communication session between a hearing-impaired user 140 and a hearing-capable user 160. By way of example only, communication device 190 may comprise a conventional voice phone. As such, hearing-capable user 160 may interact in a conventional manner with communication device 120 through the use of a voice-based dialogue conveyed over communication device 190. The voice of hearing-capable user 160 may be conveyed over communication device 190 and may be transmitted over network 180 to communication device 120. Furthermore, the voice conveyed over communication device 190 may be transmitted through communication device 120, over network 170, and to relay service 110.

By way of example, communication device 120 may include a captioned telephone, a telephone enabled for text enhanced telephony, or any other suitable communication device configured to receive and display text. Hearing-impaired user 140 may interact in a conventional manner with communication device 190 through the use of a voice-based dialogue conveyed over communication device 120. Furtherer more, as described more fully below, communication device 120 may be configured to receive and display a text transcription of a voice signal sent from relay service 110 via network 170.

In various embodiments of the disclosure, instructions implementing an “application program” may be tangibly embodied in a computer-readable medium which may include one or more fixed or removable data storage devices, such as a zip drive, floppy disc drive, hard drive, CD-ROM drive, tape drive, flash memory device, etc. Further, an application program may include instructions that, when read and executed by a processor, may cause the processor to perform the steps necessary to implement and/or use embodiments of the disclosure. An application program and/or operating instructions may also be tangibly embodied in a memory and/or data communications devices, thereby making a computer program product or article of manufacture according to an embodiment the disclosure. As such, the term “application program” as used herein is intended to encompass a computer program accessible from any computer readable device or media. Furthermore, portions of an application program may be distributed such that some of the application program may be included on a computer readable media within a processor-based device (e.g., device 102 or device 152) and some of the application program may be included in a removable data storage device.

FIG. 2 illustrates a block diagram of communication system 100. As illustrated, relay service 110 may include a display device 132 coupled to a processor-based device 102 comprising a processor 104 and a memory 106, such as random access memory (RAM). Device 102 may also implement a compiler (not shown) that allows an application program 108 written in a programming language to be translated into processor 104 readable code. Application program 108 may be configured to access and manipulate data stored in memory 106 of device 102 using relationships and logic that are generated using the compiler.

Communication device 120 may include a display device 134 and a processor-based device 152 comprising a processor 154 and a memory 156, such as random access memory (RAM). Device 152 may also implement a compiler (not shown) that allows an application program 118 to be translated into processor 104 readable code. Application program 118 may be configured to access and manipulate data stored in memory 156 of device 152 using relationships and logic that are generated using the compiler.

During a contemplated operation of communication system 100, hearing-capable user 160 (see FIG. 1) may convey speech into communication device 190, which may then transmit an associated voice signal to communication device 120 over network 180. Communication device 120 may then transmit the voice signal to relay service 110. Upon receipt of a voice signal, a human call assistant (not shown) positioned within relay service 110, may listen to the voice signal transmitted from communication device 120 and “revoice” the words to a speech recognition computer program (not shown) within relay service 110. As will be understood by one having ordinary skill in the art, revoicing is an operation in which the call assistant repeats the words she or he hears upon receipt of the voice signal. The speech recognition program may then output a text transcription of the call assistant's spoken words and, thereafter, the text transcription may be transmitted to communication device 120 via network 170. Hereinafter, a text transcription output from a speed recognition program may also be commonly referred to as a “block” of text. For example, a block of text may comprise one or more words, one or more sentences, one or more lines of text, or any combination thereof.

Hearing-impaired user 140 (see FIG. 1) positioned proximate communication device 120 may receive, via network 180, the voice signal transmitted from communication device 190. Furthermore, a block of a text output from the speech recognition program and associated with the transmitted voice signal may be displayed within a text caption on display device 134 as it is sent from relay service 110 and received by communication device 120. As a result, hearing-impaired user 140 may listen to the voice signal transmitted from communication device 190 and, immediately thereafter, receive a block of text associated with the voice signal. In order to enhance the effectiveness of communication system 100, it is desirable for hearing-impaired user 140 to either receive the block of text as close to real-time as possible, or as close as possible to simultaneous receipt of the voice signal. For example only, hearing-impaired user 140 may listen to the voice signal transmitted from communication device 190 and, within three second or less, receive a block of text associated with the voice signal.

Furthermore, it should be noted that a block of text, as output from the speech recognition program and as transmitted to and displayed within a text caption on display device 134, may also be displayed within a text caption displayed on display device 132. As a result, the call assistant may view the text caption, including one or more blocks of text, as displayed on display device 134 and as viewed by hearing-impaired user 140. Therefore, any errors that may exist in the text caption displayed on display device 134 may also be visible to the communication assistant on display device 132.

In accordance with various embodiments of the disclosure, application program 108 may be configured to provide in-line correction of any errors that may be detected within a text caption displayed on display device 134. Stated another way, application program 108 may be configured to replace any detected errors within a displayed text caption with corrected text. More specifically, in the event the call assistant notices one or more errors within a block of text of the text caption as displayed on each of display device 134 and display device 132, the call assistant may edit the block of text including the one or more errors through input into device 102 to correct the one or more errors. For example only, the call assistant may edit a block of text through input into a keyboard (e.g., selecting and replacing text, inserting text, and/or deleting text). As a result, a corrected block of text including one or more corrected words may be generated. Thereafter, the corrected block of text including the one or more corrected words may be sent to communication device 120. Upon receipt of the corrected block of text at communication device 120, application program 118 may be configured to replace the block of text including the one or more errors with the associated corrected block of text. Furthermore, substantially simultaneously upon replacing the block of text including the one or more errors with the associated corrected block of text, application program 118 may be configured to display the corrected text caption on display device 134.

Furthermore, in order to make a correction more obvious to a hearing-impaired user reading the text caption, application program 108 may further be configured to identify each corrected word within the text caption with an identifier (e.g., a mark or a tag). By way of example and not by way of limitation, application program 108 may be configured to identify each corrected word by highlighting each corrected word displayed in a text caption on display device 134.

With reference to FIG. 3, a text caption 310 including a plurality of errors is depicted. Furthermore, as illustrated, text caption 310 includes blocks of text 312, 314, and 316. For example, each block of text 312, 314, and 316 may be generated by a speech recognition program and subsequently transmitted from relay service 110 and received by and displayed within text caption 310 displayed on display device 134 (see FIG. 2). Furthermore, text caption 310 may be displayed on display device 132 within relay service 110 (see FIG. 2) visible to a call assistant. In the event the call assistant notices one or more errors within text caption 310, the call assistant may edit the text displayed on display device 132 through input into device 102 to correct the one or more errors and, as a result, may generate a corrected block of text including one or more corrected words. For example, with reference to FIGS. 3 and 4, a call assistant may recognize one or more errors within block 314 and, thereafter, generate a corrected block of text 414 including one or more corrected words 414 and 416. Thereafter, corrected block of text 414 may be transmitted from relay service 110 and received by communication device 120. Block of text 314 may then be replaced by corrected block of text 414, as illustrated in text caption 510 (see FIG. 5). Further, as illustrated in FIG. 5, application program 118 (see FIG. 2) may be configured to display corrected words (i.e., words 414 and 416) within text caption 510 with a marking (i.e., a tag), such as a highlight 520.

It should be noted that a block of text including one or more corrected words may be sent at any suitable time after the call assistant has identified one or more errors and made associated corrections. For example, the call assistant may notice one or more errors in the most recently sent block of text, correct the one or more errors, and transmit a corrected block of text including one or more corrected words prior to sending another block of text. On the other hand, the call assistant may not notice one or more errors in a block of text until after one or more blocks have subsequently been sent. In any event, any block of text displayed on a display device 134 (see FIG. 2) may be replaced with a corrected block of text including one or more corrected words.

FIG. 6 is a flow chart illustrating a method 600 of correcting one or more textual errors within a text caption, in accordance with one or more embodiments of the disclosure. Method 600 may include displaying a text caption including one or more blocks of text on each of a first device and a second device 612. Moreover, method 600 may include identifying one or more errors within a block of text within text caption 602 and generating a new block of text having corrected word for each identified error 604. Method 600 may further include replacing the block of text having the one or more errors with the new block of text in the text caption 606. In addition, method 600 may include displaying the text caption on the second device wherein the block of text having the one or more errors is replaced by the new block of text 608. In addition, method 600 may include tagging each corrected word displayed within the text caption 610.

As described above in various embodiments of the disclosure, an error detected in a text caption may be replaced with corrected text (i.e., “in-line” correction) and, therefore, in comparison to conventional text caption correction methods, a user viewing the text caption may be provided with a context of the error correction. Stated another way, upon completion of an error correction within a text caption, a reader may understand how the correction relates to the text caption as a whole and will not be required to guess as to where the correction belongs within the text caption or how the correction applies to the text caption. Furthermore, in comparison to conventional text caption correction methods, in-line correction of a text caption may reduce distractions to a hearing-impaired user and, therefore, interruptions in the flow of conversation between a hearing-capable user and a hearing-impaired user may be reduced.

While the disclosure has been described herein with respect to certain preferred embodiments, those of ordinary skill in the art will recognize and appreciate that it is not so limited. Rather, many additions, deletions, and modifications to the preferred embodiments may be made without departing from the scope of the invention as hereinafter claimed. In addition, features from one embodiment may be combined with features of another embodiment while still being encompassed within the scope of the invention as contemplated by the inventors. 

What is claimed is:
 1. A communication system including: a first communication device specifically configured for use by a call assistant of a remote captioning service providing captioning assistance for a hearing-impaired user during a real-time communication session; and a second communication device specifically configured for use by the hearing-impaired user to provide captions displayed to the hearing-impaired user during the real-time communication session; wherein the first communication device comprises: a first memory device having a speech recognition program stored therein; a first input device configured to receive inputs from the captioning assistant; a first processor operably coupled with the first memory device and the first input device, the first processor configured to: receive a voice signal during a real-time communication session between at least two parties, the voice signal including at least audio from a far end user for the real-time communication session; generate a text transcription for the audio for the far-end user from the voice signal during the real-time communication session using the speech recognition program; transmit a first block of text of the text transcription to the second communication device for display by the second communication device during the real-time communication session; receive the inputs from the call assistant as edits to the text transcription; and transmit a replacement block of text with the edits to the second communication device after transmission of the first block to the second communication device has already occurred, the replacement block of text being an inline correction for the first block of text that was already received and displayed by the second communication device; and wherein the second communication device comprises: second electronic display; and second processor operably coupled with the second electronic display, the second processor configured to: receive the voice signal and during the real-time communication session; receive the first block of text of the text transcription from the remote captioning service; cause the first block of text of the text transcription to be displayed by the second electronic display as captions for the hearing-impaired user during the real-time communication session; receive the replacement block of text from the remote captioning service after the first block of text has been received and displayed by the second electronic display; and cause the replacement block of text to be displayed by the second electronic as an inline correction for the first block of text previously displayed by the second communication device.
 2. The communication system of claim 1, wherein the first processor is configured to transmit the first block of text and the replacement block of text to the second communication device over a telephone network selected from the group consisting of the Public Switch Telephone Network and a VOIP network.
 3. The communication system of claim 1, wherein the replacement block of text is selected from the group consisting of at least one word, at least one sentence, and at least one line of text.
 4. The communication system of claim 1, wherein the first processor is programmed to generate the text transcription using a speech recognition program with assistance of a call assistant revoicing words of the voice signal.
 5. The communication system of claim 1, further comprising a first display device operably coupled with the first processor, the first display device configured to receive the text transcription from the first processor and display the text transcription visible for the call assistant to review the text transcription for identifying errors.
 6. The communication system of claim 1, wherein the first input device includes a keyboard.
 7. The communication system of claim 4, wherein the first processor is configured to insert the replacement block of text as an inline correction as displayed by the first display device associated with the call assistant responsive to the call assistant selecting and replacing text, inserting text, or deleting text.
 8. The communication device of claim 1, wherein the second processor is further configured to transmit the voice signal to the remote captioning service prior to the voice signal having the text transcription thereof being generated by the remote relay service.
 9. The communication device of claim 1, wherein the second processor is configured to cause the second electronic display to provide a visual indication that the replacement block of text has replaced the first block of text as displayed by the second electronic display.
 10. The communication device of claim 9, wherein the visual indication includes the replacement block of text being displayed with highlighting.
 11. The communication system of claim 1, further comprising a first application program stored in the first memory that, when executed by the first processor, is configured to generate and transmit the replacement block of text.
 12. The communication system of claim 1, wherein the second communication device further comprises a second memory operably coupled with the second processor, the second communication device further comprising a second application program stored in the second memory that, when executed by the second processor, is configured to cause the replacement block of text to be displayed by the second electronic as the inline correction for the first block of text previously displayed by the second communication device.
 13. A method of providing error correction in a caption-based communication system, the method comprising: receiving, at a first communication device associated with a call assistant within a captioning service, a voice signal during a real-time communication session between a second communication device associated with a hearing-impaired user and a third communication device; receiving, at a second communication device, the voice signal from the first communication device within the remote communication device during the real-time communication session; generating, at the first communication device, a text transcription for the voice signal during the real-time communication session using a speech recognition program; transmitting, from the first communication device to the second communication device, a first block of text of the text transcription; receiving, at the second communication device, the first block of text of the text transcription of the voice signal from the remote captioning service; displaying the first block of text on a second electronic display of the second communication device during the real-time communication session; receiving, through a first input device of the first communication device, corrections corresponding to an error within at least a portion of a text transcription after of the text transcription to the second communication device has already occurred; generating, at the first communication device, a replacement block of text responsive to the corrections; transmitting the replacement block of text from the first communication device to the second communication device as an inline correction for the error with instructions for the second communication device to indicate that the block of text is a correction for the portion of the text transcription to be replaced; receiving, at the second communication device, the replacement block of text from the remote captioning service after the first block of text has been received and displayed by the second electronic display; and displaying the replacement block of text on the second electronic display as an inline correction for the first block of text previously displayed by the second communication device.
 14. The method of claim 13, wherein receiving corrections into the first input device includes at least one of replacing text, inserting text, or deleting text.
 15. The method of claim 13, further comprising displaying the replacement block of text as an inline correction on a first electronic display of the first communication device to be visible to the call assistant during the real-time communication session.
 16. The method of claim 13, wherein displaying the replacement block of text includes displaying a visual indication on the second electronic display that the replacement block of text has replaced the first block of text as displayed by the second electronic display.
 17. The method of claim 16, wherein the visual indication includes at least one of a mark or a tag of the replacement block of text as displayed by the second electronic display.
 18. The method of claim 13, further comprising transmitting the voice signal from the first communication device to the remote captioning service prior to the text transcription being generated for that portion of the voice signal.
 19. The method of claim 13, wherein receiving, at the second communication device, the replacement block of text from the remote captioning service occurs within three seconds or less from when the voice signal from the first communication device is received by the second communication device.
 20. The method of claim 13, wherein generating, at the first communication device, a replacement block of text responsive to the corrections occurs through an application program stored in a first memory of the first communication device as executed by a first processor of the first communication device. 